Serveur d'exploration sur la recherche en informatique en Lorraine

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

Une nouvelle architecture de compensation du bruit pour la reconnaissance robuste de la parole

Identifieur interne : 006804 ( Main/Exploration ); précédent : 006803; suivant : 006805

Une nouvelle architecture de compensation du bruit pour la reconnaissance robuste de la parole

Auteurs : Khalid Daoudi ; Murat Deviren

Source :

RBID : CRIN:daoudi04a

English descriptors

Abstract

We present a novel noise compensation architecture which makes no assumptions on how the noise sources alter the speech data and which do not rely on clean speech models. Rather, this new architecture makes the (realistic) assumption that speech databases recorded under different background noise conditions are available. Its main principle is to process individually each database and to construct a parametric representation which describes the variation of acoustic models w.r.t. noise models. This representation is then used during recognition to estimate the acoustic models in the new environment. We evaluate the performance of this new compensation scheme on a connected digits recognition task and show that it can perform significantly better than multi-conditions training, which is the most widely used technique in these kind of scenarios.


Affiliations:


Links toward previous steps (curation, corpus...)


Le document en format XML

<record>
<TEI>
<teiHeader>
<fileDesc>
<titleStmt>
<title xml:lang="en" wicri:score="69">Une nouvelle architecture de compensation du bruit pour la reconnaissance robuste de la parole</title>
</titleStmt>
<publicationStmt>
<idno type="RBID">CRIN:daoudi04a</idno>
<date when="2004" year="2004">2004</date>
<idno type="wicri:Area/Crin/Corpus">003D88</idno>
<idno type="wicri:Area/Crin/Curation">003D88</idno>
<idno type="wicri:explorRef" wicri:stream="Crin" wicri:step="Curation">003D88</idno>
<idno type="wicri:Area/Crin/Checkpoint">000632</idno>
<idno type="wicri:explorRef" wicri:stream="Crin" wicri:step="Checkpoint">000632</idno>
<idno type="wicri:Area/Main/Merge">006B07</idno>
<idno type="wicri:Area/Main/Curation">006804</idno>
<idno type="wicri:Area/Main/Exploration">006804</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title xml:lang="en">Une nouvelle architecture de compensation du bruit pour la reconnaissance robuste de la parole</title>
<author>
<name sortKey="Daoudi, Khalid" sort="Daoudi, Khalid" uniqKey="Daoudi K" first="Khalid" last="Daoudi">Khalid Daoudi</name>
</author>
<author>
<name sortKey="Deviren, Murat" sort="Deviren, Murat" uniqKey="Deviren M" first="Murat" last="Deviren">Murat Deviren</name>
</author>
</analytic>
</biblStruct>
</sourceDesc>
</fileDesc>
<profileDesc>
<textClass>
<keywords scheme="KwdEn" xml:lang="en">
<term>noise robustness</term>
<term>speech recognition</term>
</keywords>
</textClass>
</profileDesc>
</teiHeader>
<front>
<div type="abstract" xml:lang="en" wicri:score="2655">We present a novel noise compensation architecture which makes no assumptions on how the noise sources alter the speech data and which do not rely on clean speech models. Rather, this new architecture makes the (realistic) assumption that speech databases recorded under different background noise conditions are available. Its main principle is to process individually each database and to construct a parametric representation which describes the variation of acoustic models w.r.t. noise models. This representation is then used during recognition to estimate the acoustic models in the new environment. We evaluate the performance of this new compensation scheme on a connected digits recognition task and show that it can perform significantly better than multi-conditions training, which is the most widely used technique in these kind of scenarios.</div>
</front>
</TEI>
<affiliations>
<list></list>
<tree>
<noCountry>
<name sortKey="Daoudi, Khalid" sort="Daoudi, Khalid" uniqKey="Daoudi K" first="Khalid" last="Daoudi">Khalid Daoudi</name>
<name sortKey="Deviren, Murat" sort="Deviren, Murat" uniqKey="Deviren M" first="Murat" last="Deviren">Murat Deviren</name>
</noCountry>
</tree>
</affiliations>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Wicri/Lorraine/explor/InforLorV4/Data/Main/Exploration
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 006804 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/Main/Exploration/biblio.hfd -nk 006804 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Wicri/Lorraine
   |area=    InforLorV4
   |flux=    Main
   |étape=   Exploration
   |type=    RBID
   |clé=     CRIN:daoudi04a
   |texte=   Une nouvelle architecture de compensation du bruit pour la reconnaissance robuste de la parole
}}

Wicri

This area was generated with Dilib version V0.6.33.
Data generation: Mon Jun 10 21:56:28 2019. Site generation: Fri Feb 25 15:29:27 2022